Interestingness and Pruning of Mined Patterns

نویسندگان

  • Devavrat Shah
  • Laks V. S. Lakshmanan
  • Krithi Ramamritham
  • S. Sudarshan
چکیده

We study the following question: when can a mined pattern, which may be an association, a correlation, ratio rule, or any other, be regarded as interesting? Previous approaches to answering this question have been largely numeric. Speciically, we show that the presence of some rules may make others redundant, and therefore uninteresting. We articulate these principles and formalize them in the form of pruning rules. Pruning rules, when applied to a collection of mined patterns, can be used to eliminate redundant ones. As a concrete instance, we applied our pruning rules on association rules/positive association rules derived from a census database, and demonstrate that signiicant pruning results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Locating previously unknown patterns in data-mining results: a dual data- and knowledge-mining method

BACKGROUND Data mining can be utilized to automate analysis of substantial amounts of data produced in many organizations. However, data mining produces large numbers of rules and patterns, many of which are not useful. Existing methods for pruning uninteresting patterns have only begun to automate the knowledge acquisition step (which is required for subjective measures of interestingness), he...

متن کامل

Pruning based interestingness of mined classification patterns

Classification is an important problem in data mining. Decision tree induction is one of the most common techniques that are applied to solve the classification problem. Many decision tree induction algorithms have been proposed based on different attribute selection and pruning strategies. Although the patterns induced by decision trees are easy to interpret and comprehend compare to the patte...

متن کامل

What Is Interesting: Studies on Interestingness in Knowledge Discovery

Knowledge Discovery in Databases (KDD) was defined by [FPSS96a] as “[. . . ] the non-trivial process of identifying valid, novel, potentially useful and ultimately understandable patterns in data.” As the size of databases increases, the number of patterns mined from them also increases. This number can easily increase to an extent that overwhelms users. To address this problem, patterns need t...

متن کامل

Mining Approximate Functional Dependencies from Databases Based on Minimal Cover and Equivalent Classes

Data Mining (DM) represents the process of extracting interesting and previously unknown knowledge from data. Approximate Functional Dependencies (AFD) mined from database relations represent potentially interesting patterns and have proven to be useful for various tasks like feature selection for classification, query optimization and query rewriting. The discovery of AFDs still remains under ...

متن کامل

Interestingness of Discovered Association Rules in Terms of Neighborhood-Based Unexpectedness

One of the central problems in knowledge discovery is the development of good measures of interestingness of discovered patterns. With such measures, a user needs to manually examine only the more interesting rules, instead of each of a large number of mined rules. Previous proposals of such measures include rule templates, minimal rule cover, actionability, and unexpectedness in the statistica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999